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PLASMIN-RESISTANT STREPTOKINASE 
Background of the Inventilon 
Streptokinase (SK) , isolated from Group c 
5 streptococcus, is used as a plasminogen activator to 
accelerate the lysis of the coronary thrombi that cause 
heart attacks. However, SK is by itself inert and must 
combine with human plasminogen to form a catalytically- 
active SK-plasminogen activator complex (SK-PAC) which 
10 cleaves substrate plasminogen molecules. Studies of 
proteolytic fragments of SK and recombinant truncation 
mutants have defined regions of SK which are important 
for binding interactions with plasminogen in the 
construction of the activator complex. Through undefined 
15 molecular interactions, an active site appears in the 
plasminogen moiety of the SK-PAC (Buck et al., 1968, J. 
Biol. Chem. 246:209-246). The SK-PAC then generates the 
active enzyme plasmin by clipping substrate plasminogen 
molecules at the Arg560-Val bond (Robbins et al., 1987, 
20 In Colman et al., Hemostasis and thrombosis: basic 

principles and clinical practice, 2nd ed. , Lippincott, 
Philadelphia, pp. 341-357) . 

Almost immediately after forming an active SK-PAC, 
the SK moiety is clipped to smaller molecular weight 
25 forms (Siefring and Castellino, 1976, j. Biol. Chem. 
251:3913-3920; Markus et al., 1976, J. Biol. Chem. 
251:6495-6504). Cleavage of SK markedly reduces the 
catalytic activity of the activator complex (Markus et 
al., 1976, supra). Enzymatic studies of SK fragments 
30 isolated after reacting with plasminogen at lower 
temperatures suggests that SK activity declines with 
progressive cleavage (Markus et al., 1976, supra). 

Inactivation of SK in plasma as a result of 
plasmin cleavage reduces the therapeutic effectiveness of 
35 this plasminogen activation. 
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Suimarv of tihe InventilQn 
The SK-derived compounds of the invention resist 
cleavage inactivation by plasmin, while retaining all or 
a substantial portion of the plasminogen-binding and 
5 catalytic activity of native SK. SK modified according 
to the invention is a more potent thrombolytic agent than 
native SK, and therefore, is a more useful therapeutic 
tool. 

The invention features a compound containing (a) a 
10 plasminogen-binding fragment of SK and (b) a blocking 

group at the amino-terminus of the fragment. By the term 
"streptokinase" is meant an indirect plasminogen 
activator derived from streptococci. By the term 
"fragment" is meant a polypeptide containing less than or 
15 all of the native, full-length amino acid sequence of SK. 
SK may be recombinant or purified from streptococci, and 
the streptococci from which it is derived is preferably 
^-hemolytic. Alternatively, the streptokinase may be 
derived from an a-hemo lytic streptococci. The 
20 streptococci from which SK is derived is preferably from 
Group C, e.g.. Streptococcus eguisimilus, however SK may 
also be derived from streptococci of Group A or Group G. 

The compound is catalytically active and the rate 
of in vitro degradation in the presence of human 

25 plasminogen is at least two times slower than the rate of 
native, full-length mature SK protein derived from 
Streptococcus eguisimilus (nSK) , i.e., the time required 
from the addition of SK to plasminogen to the 
disappearance of the band on a Western blot corresponding 

30 to the uncleaved nSK. For example, the time required for 
the disappearance of uncleaved nSK is about 2 min. , 
whereas the time for the disappearance of modified SK 
ranges from 7 min. to greater than 20 min. By the term 
"catalytically active" is meant it possesses the ability 
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of SK to interact with plasminogen to form a SK-PAC 
capable of activating plasminogen to plasmin. By the 
term "degradation" is meant the process by which SK is 
reduced by plasmin cleavage into lower molecular weight 
5 fragments. The rate of degradation is measured by the 
disappearance of a full-length recombinant SK as detected 
by immunoblotting using anti-SK antibodies. 

The compound preferably contains the amino acid 
sequence of SEQ ID NO: 4. The blocking group of the 

10 compound may be a peptide or a non-peptide blocking group 
which is located at the amino-terminus of the SK 
fragment. For example, a blocking group may be 
introduced by glycosylation or myristolization. 
Preferably, the blocking group is least one heterologous 

15 amino acid; more preferably, the blocking group is a 

heterologous peptide of two or more amino acids; and most 
preferably, the blocking group is a fragment of or all of 
maltose binding protein (MBP) . By the term 
"heterologous" is meant an addition or substitution of 

20 one or more amino acids that is different from that found 
at the corresponding site in nSK. 

The invention also includes a DNA, e.g., a DNA 
vector, containing a coding sequence which encodes the 
polypeptide portion of the compound of the invention, and 

25 a method of dissolving blood clots in a mammal by 

administering an effective amount of the compound. An 
effective amount of the compound is an amount which is 
effective in dissolving at least one blood clot in a 
patient. 

30 The invention also features a plasminogen-binding 

fragment of SK which is catalytically active and the rate 
of in vitro degradation of which is at least two times 
slower than the rate of nSK in the presence of human 
plasminogen. The fragment preferably comprises at least 

35 95% of the amino acid sequence of nSK; more preferably. 
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the fragment lacks one to five amino-terminal amino acids 
of nSK; more preferably, the fragment lacks one to ten 
amino-terminal amino acids; more preferably, the fragment 
lacks 1-24 amino acids. In a preferred embodiment, the 
5 fragment consists of amino acids 14-414 of nSK (SEQ ID 
NO: 4). A fragment consisting of amino acids 14-414 of 
nSK (SEQ ID NO: 4) may also contain at least one or more 
mutations selected from the group consisting of K36A, 
R45A, K51A, K59A, K61A, K147A, K333, R232A, K257A, K298A, 
10 K309A, R234A, R363A, K386A, K372A, R388A, R394A, and 
R401A. 

The invention also includes an SK polypeptide 
which is catalytically active and the rate of in vitro 
degradation of which is at least two times slower 

15 compared to the rate of nSK. By "polypeptide" is meant a 
chain of amino acids, regardless of length or post- 
translational modification (e.g., glycosylation or 
phosphorylation) . Preferably, the polypeptide consists 
of the amino acid sequence of nSK in which at least one 

20 potential plasmin cleavage site has been mutated to 
render it resistant to plasmin cleavage. More 
preferably, the polypeptide contains one or more 
mutations selected from the group consisting of RIGA, 
K36A, R45A, K51A, K59A, K61A, K147A, K333, R232A, K257A, 

25 K298A, K309A, R234A, R363A, K386A, K372A, R388A, R394A, 

and R401A. Most preferably, the fragment is rSKSmut (SEQ 
ID NO: 17), which contains the mutations, RlOA, R36A, 
R45A, R51A, and R59A or rSKSmut, which contains the 
mutations RlOA, R36A, R45A, R51A, R59A, and K386A (SEQ ID 
30 NO: 18). The invention also includes a DNA containing a 
coding sequence encoding the SK polypeptide of the 
invention and a method of dissolving blood clots in a 
mammal by administering to the mammal an effective amount 
of the SK polypeptide of the invention. 
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Other features and advantages of the invention 
will be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 



Detailed Description 

5 The drawings will first be described. 

Drawings 

Fig. 1 is a photograph of a Western blot showing 
purification of a fusion protein with maltose binding 
protein linked to the amino terminus nSK (rSK) , a fusion 
10 protein with MBP linked to the amino terminus of nSK in 
which the amino terminal 13 amino acids of nSK were 
deleted (rSKAl4) , and rSKSmut. 

Fig. 2 is a graph showing plasminogen activation 
by nSK, rSK, and rSKAl4. 

15 Fig. 3 is a photograph of a Western blot showing 

plasmin cleavage of nSK. 

Fig. 4 is a photograph of a Western blot showing 
plasmin cleavage of rSK (0-20 min.). 

Fig. 5 is a photograph of a Western blot showing 
20 plasmin cleavage of rSKAl4. 

Fig. 6 is a photograph of a Western blot showing 
plasmin cleavage of rSKSmut. 

Fig. 7 is a photograph of a Western blot showing 
comparative plasmin cleavage of rSK, rSKAl4, nSK, and 
25 rSKSmut. 



Modification of SK to render it resistant to degradation 
bv plasmin 

Within seconds, binding of SK to plasminogen to 
form SK-PAC, nSK is rapidly degraded at its amino 
30 terminus by plasmin. Through the process of degradation, 
plasmin limits the throinbo lytic efficacy of nSK. 
According to the invention, SK can be modified in three 
different ways to render it resistant to plasmin 
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Cleavage: (i) by blocking the amino terminus of nSK, 
e.g., with a heterologous peptide; (2) by deleting one or 
more amino terminal amino acids from nSK; and (3) by 
altering plasmin cleavage sites throughout nSK to render 
5 them resistant to plasmin cleavage. in one example, 

a recombinant fusion protein was made in which the amino 
terminus of nSK was tethered in peptide linkage to MBP 
(rSK) . In another example, a recombinant fusion protein 
was made in which the MBP was linked to the amino 
10 terminus of nSK, the first 13 amino acids of which were 
deleted. In the third example, the nSK amino acid 
sequence was mutated at plasmin-cleavage sites to render 
those sites resistant to enzymatic cleavage, e.g., in the 
mutant rSKSmut, the K or R residue in five potential 
15 plasmin cleavage sites were changed to A residues. In 
each case, plasmin cleavage yielded catalytically active 
plasmin cleavage products, but the rate of degradation 
was markedly reduced compared to that of nSK. In 
addition to affecting the rate of degradation, mutation 
20 of plasmin cleavage sites also significantly decreases 
the of amidolytic activity, which leads to greater 
catalytic efficiency. 

Therapeutic Applications 

The compounds of the invention can be used to lyse 
25 blood clots in a mammal. The compounds can be 
administered by any standard route including 
intraperitoneal ly, intramuscularly, subcutaneous ly, or 
intravenously. It is expected that the preferred route 
of administration will be intravenous. The compounds can 
30 be administered systemically to the bloodstream as well 
as locally within the blood vessel at the site of clot 
formation. Since the compounds of the invention are 
timied-release, they can be administered in a single dose 
rather than by continuous infusion. 
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As is well known in the medical arts, dosages for 
any one patient depends on many factors, including the 
patients general health, sex, size, body surface area, 
age, as well as the particular compound to be 
5 administered, time and route of administration, and other 
drugs being administered concurrently. Dosages for the 
compounds of the invention will vary, but a preferred 
dosage for administration to human patients is 
approximately 20,000 units per kg of body weight (units 

10 of SK are defined in Bulletin. World. Health. Org., 1965, 
33:235). Determination of correct dosage for a given 
application is well within the abilities of one of 
ordinary skill in the art of pharmacology. Optimal 
dosage may be adjusted according to the condition of the 

15 patient and response of the patient to therapy. 

EXAMPLE l! Modification of the amino terminus of 
streptokinase modul ates the appearance of the active site 
in the SK~PAC 

To examine the functional role of the amino 
20 terminus of SK in the SK-PAC, the amino terminus of SK 
was recombinantly modified by partial deletion of amino- 
terminal amino acids or by tethering of the amino 
terminus with a blocking group, e.g., a heterologous 
peptide. Functional activity of the modified SK was 
25 evaluated by measuring (1) the rate of plasminogen 

activation by SK-PAC, (2) the amidolytic activity of the 
SK-PAC, and (3) the plasmin-mediated degradation of SK in 
the SK-PAC. 

Cloning, Expression and Purification of Streptokinase - 
30 The SK gene (Malke et al., 1985, Gene 34:357-362) 

was cloned from Streptococcus equisimilis by the 
polymerase chain reaction (PGR), sequenced (U.S. 
Biochemicals, Cleveland, Ohio; Sanger et al., 1977, Proc. 
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Natl. Acad. Sci USA 74:5463) and subcloned into the pMAL 
vector for bacterial expression (New England Biolabs, 
Beverly, MA) using known methods, e.g.. Reed et al., 
1993, J. Immunol. 150:4407-4415; Reed et al., 1993, 
5 Circulation 88: Abstract 1-615). The expressed SK gene 
formed a fusion protein with maltose binding protein at 
its amino terminus (rSK) . Restriction digestion of the 
SK gene with Hinc II removed the nucleotides encoding the 
amino tejrminal 13 amino acids of SK to produce deletion 
10 mutant, rSKAl4. These recombinant SK fusion proteins 
were purified by affinity chromatography on an amylose 
resin (New England Biolabs, Beverly, MA) as described by 
the supplier. The purity of the recombinant SK fusion 
proteins was assessed by SDS-PAGE (Laemmli, 1970, Nature 
15 227:680-685). For some experiments, the SK fusion 
proteins were cut with factor Xa (Maina et al., 1988, 
Gene 74:365) and the MBP portion of the fusion protein 
removed by affinity chromatography on an amylose resin. 

After purification, the relative concentrations of 
20 the recombinant SKs were determined by comparative 
radioimmunoassay (RIA) using anti-SK monoclonal 
antibodies. Wells of a microtiter plate were coated with 
various concentrations of nSK (0, 2.5, 5, 10, 20, and 40 
Mg/mL) or different dilutions of the recombinant SKs, 
25 rSKAl4 and rSK5mut. After nonspecific binding sites had 
been blocked with i% bovine serum albumin, anti-SK 
monoclonal antibodies were added to each well in 
duplicate. After a 1-h incubation, the wells were washed 
and probed with ^^Sj g^^^ anti-mouse antibody (Cappel 
30 Organon Teknika, Durham, NC) for l h. After another 
wash, the amount of bound antibody was determined by 
gamma counting. A standard curve relating antibody 
binding (cpm) to nSK concentration was derived and the 
concentration of each recombinant SK was determined by 
35 reference to the standard curve. 
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Plasminogen Activali ion bv recombinant SKs. 

Studies of the time-related activity of different 
SKs were carried out by mixing Glu-plasminogen (333 nM; 
American Diagnostica, Greenwich, CT) in a quartz cuvette 
5 with S2251 (0.5 mM; H-D-valyl-L-leucyl-L-lysine-p- 
nitroanilide dihydrochloride, Chromogenix, Sweden) at 
21<>C or 370c and then adding purified nSK, rSK, or rSKAl4 
(16.7 nM) . Absorption at 405 nanometers was continuously 
monitored in a Hewlett-Packard diode array 
10 spectrophotometer . 

Active Site Titration 

The development of an active site in the SK-PAC 
was monitored using standard methods. Plasminogen (8.5 
Mg; Sigma, St. Louis, MO) was added to a quartz cuvette 

15 containing 2 ml of filtered buffer (50 mM, 100 mM NaCl, 
pH 7.4) and 1 mM of the fluorogenic substrate 
4-methylumbelliferyl p-guanidinobenzoate (Sigma, St. 
Louis, MO) thermostatically maintained at 25 ^C. The 
emission at 445 nanometers (excitation at 365 nanometers) 

20 was continuously monitored in a Hitachi 2000 fluorescence 
spectrophotometer. After -200 seconds of observation, 
rSK was added, and the reaction was recorded for a total 
of 2000 seconds. 

Kinetic Assays of the SK-PAC 

25 The amidase kinetic parameters of nSK, rSK and 

rSKAl4 were studied using a paranitroanilide substrate 
(S2251 , H-D-valyl-L-leucyl-L-lysine-p-nitroanilide 
dihydrochloride, Chromogenix, Sweden) using known 
methods, e.g., Wohl R. et al., 1980., Biochim. et 

30 Biophys. Acta 745:20-31). The recombinant SK proteins 
and Glu-plasminogen were mixed together and incubated for 
5 min. (nSK and rSK) or 20 min (rSKAl4) at 37**C. The 
mixture was then transferred to a quartz cuvette 
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containing assay buffer (50 mM Tris, lOO mM NaCl, pH 7.4) 
and various concentrations of S2251 (100-800 iM) added. 
The cuvette was thermostatically regulated at 37<»C. The 
change in absorbance was monitored at 404 nM for 10 min. 
s at 37«C, and the data were transformed to Linewaever- 
Burke plots to determine the and V^^j^. 

Studies of the decfradation n f SK bv plasm in 

The time-related proteolysis of nSK, rSK, rSKAl4, 
and rSKSmut was studied by immunoblotting . nSK (1 fig) or 

10 recombinant SKs (2 fig) were mixed together with purified 
human 61u-plasminogen (40 ngs; American Diagnositica, 98% 
Glu-type plasminogen) for 0-20 min. The amount of human 
plasminogen present is typically in excess of the amount 
of SK. At various time points, an aliquot (5 fil) was 

15 removed and plunged into boiling water to stop the 

reaction. The samples were then electrophoresed on 10% 
SDS-polyacrylamide under reducing conditions and 
electrophoretically transferred to polyvinyl idene 
dif luoride membranes (Millipore, Bedford, MA) . 

20 Nonspecific binding sites were blocked with 5% nonfat 
milk for 1 hr. The blots were incubated with pooled 
monoclonal antibodies specific for SK overnight at 4«>c. 
The blots were washed and incubated for 1 hr. with I25j_ 
goat antimouse antibody (-1,000,000 cpm; Cappel Organon 

25 Teknika, Durham, NC) which had been labelled using the 
lodogen labelling method known in the art. After 
washing, the blots were exposed to Kodak X-O-mat film 
(Rochester, NY) at -70<»C. 



I 
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Amino-terminal modification ot SK 

SK was produced as a fusion protein with MBP at 
its aroino terminus (rSK) , the amino acid sequence of 
which is shown in Table 1. A mutant lacking the first 13 
amino acids of SK was also produced as a fusion protein 
(rSKAl4) , the amino acid sequence of which is shown in 
Table 2. The amino acid sequence of nSK is shown in 
Table 3, and the amino acid sequence of SKa14 is shown in 
Table 4* The sequence of both rSK and rSKAl4 suggested 
that they could be cleaved at the fusion protein junction 
by factor Xa. The production of the rSK proteins in E. 
coli was induced by IPTG. Recombinant SK proteins were 
purified from bacterial lysates by affinity 
chromatography. As shown in Fig. i, the proteins 
migrated at the predicted molecular size (rSK: 89 kDa, 
rSKAl4: 87 kDa) . 

Table 1: rSK 

MKTEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDG 

PDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEAL 

SLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFK 

YENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTIN 

GPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYL 

LTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWY 

AVRTAVINAASGRQTVDEALKDAQTNSSSVPGRGSIEGRIAGPEWLLDRPSVNNSQL 

WSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKLE 

KADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPV 

QEFLLSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAI 

GDTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVKN 

REQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKYVDVDT 

NELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDDT 

NRIITVYMGKRPEGENASYHLAYDKDRYTEEEREVYSYLRYTGTPIPDNPNDKNNSQ 

LWSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKL 

EKADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQP 

VQEFLLSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLA 
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IGDTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVK 

NREQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKYVDVD 

TNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDD 

TNRIITVYMGKRPEGENASYHLAYDKDRYTEEEREVYSYLRYTGTPIPDNPNDK 
(SEQ ID NO:l) 

Table 2; rSKAl4 

MKTEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKPPQVAATGD6 

PDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEAL 

SLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAAD6GYAFK 

YEN6KYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTIN 

GPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYL 

LTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWY 

AVRTAVINAASGRQTVDEALKDAQTNSSSVPGRGSIEGRNNSQLWSVAGTVEGTNQ 

DISLKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKLEKADLLKAIQEQLI 

ANVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRY 

KEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQA 

QSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSGL 

NEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKYVDVDTNELLKSEQLLTAS 

ERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDDTNRIITVYMGKRPE 

GENASYHLAYDKDRYTEEEREVYSYLRYT6TPIPDNPNDKNNSQLWSVAGTVEGTN 

QDISLKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKLEKADLLKAIQEQL 

lANVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVR 

YKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQ 

AQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSG 

LNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKYVDVDTNELLKSEQLLTA 

SERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDDTNRIITVYMGKRP 

EGENASYHIiAYDKDRYTEEEREVYSYLRYTGTPIPDNPNDK (SEQ ID NO: 2) 

Table 3; nSK 

lAGPEWLLDRPSVNNSQLWSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQGLS 
PKSKPFATDSGAMSHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNG 
KVYFADKDGSVTLPTQPVQEFLLSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNPD 
DDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDND 
IFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDP 
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FDRSHLKLFTIKyVDVDTNELLKSEQLLTASERNIJ)FIUDLYDPRDKAKIJ,YlWLDAF 

GIMDYTLTGKVEDNHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEEREVYSYL 

RYTGTPIPDNPNDKNNSQLWSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQGL 

SPKSKPPATDSGAMSHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRN 

GKVYFADKDGSVTIiPTQPVQEFLLSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNP 

DDDFRPGLKDTKLIJCTIAIGDTITSQEIJjAQAQSILNKNHPGYTIYERDSSIVTHDN 

DIFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYD 

PFDRSHLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDA 

FGIMDYTLTGKVEDNHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEEREVYSY 
LRYTGTPIPDNPNDK (SEQ ID NO: 3) 

Table 4; SKa14 

NNSQLWSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAM 

SHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRN6KVYFADKDGSVTL 

PTQPVQEFIiSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLL 

KTLAIGDTITSQELIiAQAQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFT 

YRVKNREQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKY 

VDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVED 

NHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEEREVYSYLRYTGTPIPDNPND 

KNNSQLWSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGA 

MSHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVT 

LPTQPVQEFLLSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKL 

LKTLAIGDTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDNDIPRTILPMDQEF 

TYRVKNREQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIK 

YVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVE 

DNHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEEREVYSYLRYTGTPIPDNPN 
DK (SEQ ID NO: 4} 

Functional acti vity of recombinant SKs 

To compare the function of SK, rSK and rSKAl4, the 
rate of plasminogen activation by these proteins was 
examined at 21 »C. nSK rapidly activated plasminogen with 
a minimal lag phase, i.e., less than 50 sec. (see Fig. 
2) . However, when expressed as a fusion protein, rSK 
showed a lag phase in plasminogen activation of 
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approximately 150 sec. (see Fig. 2). When expressed as a 
fusion protein lacking the amino terminal 13 amino acids, 
rSKAl4 also showed a marked delay in time to activation 
OS approximately 250 sec. (see Fig. 2). The lag phase 
5 refers to the time required for the reaction to the 
exponential phase of activity, e.g, full catalytic 
activity. 



Plasmin cleavage products 

Since nSK is known to be cleaved by plasmin after 

10 formation of the SK-PAC, the rate of cleavage of rSK and 
rSKAl4 was examined after various times of incubation 
with Glu-plasminogen. In these experiments, SK was mixed 
with an excess of plasminogen for various amounts of time 
and the resulting cleavage of SK was determined by 

15 immunoblotting with monoclonal anti-SK antibodies. nSK 
was found to be rapidly degraded by plasmin within 30 
sees to four lower molecular weight species, 
predominantly a -36 kDa fragment (see Figs. 3 and 7). In 
contrast, the degradation of rSK was slower, yielding a 

20 fragment of 47 kDa (identical in size to nSK) , first 
appearing at 1 min. A pattern of smaller SK fragments 
similar to that observed with nSK developed thereafter. 
After 5 min., a -36 kDa SK fragment similar to that seen 
after nSK cleavage was found to be the major remnant from 

25 rSK (see Figs. 4 and 7) . Other lower molecular weight SK 
fragments, e.g., -28 kDa, were also evident as cleavage 
products of nSK, and at later time points, of rSK. 
Plasmin cleavage products of rSKAl4 are shown in Fig. 5. 

Amino-terminal deletion of SK 
30 The amino terminal 13 residues of SK are highly 

conserved among the SKs produced by different groups of 
streptococci. In addition, this region constitutes a 
major epitope for both murine and human antibodies 
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against SK. Removal of the amino-terminal 13 amino acids 
from nSK resulted in a further increase in the lag phase 
of plasminogen activation by rSKAl4, as compared to rSK. 
This lag phase was marked at 21<>C, but shortened 
5 significantly when the temperature was raised to 370c. 
Active site titration experiments indicate that removal 
of the amino terminus further delays the generation of 
the active site in the rSKAl4. 

Advantages of amin o-terminal Iv modified SK 

At 37<»C, and in vivo, nSK rapidly forms an active 
site with plasminogen. The kinetics of this activation 
has been regarded as suboptimal for therapy because 
plasmin is rapidly activated in one large burst in vivo. 
To overcome the explosive activation of plasminogen, an 

15 acylated SK-PAC (APSAC) made from SK and purified human 
plasminogen has been created in vitro (Ferres, 1987, 
Drugs 33 (Suppl. 3) 33). This approach permits APSAC to 
be given as a single bolus in vivo because continuous 
deacylation of the active site proceeds with a half-life 

20 of 40 mins (Staniforth et al., 1983, Eur. J. Clin. 
Pharmacol. 24:751). A limitation of this approach is 
that the rate of appearance of the active SK-PAC is 
determined by the rate of deacylation and can not be 
otherwise modulated. 

25 In contrast, recombinant modification of the amino 

terminus of SK, either by expression as a fusion protein, 
or by deletion of the amino terminus, can predictably 
alter the rate of active site generation. For example, 
the extent to which the rate of degradation is reduced 

30 compared to nSK is directly proportional to the number of 
deleted amino-terminal amino acids (up to 13 amino 
acids) . Other advantages of the SK-derived compounds of 
the invention include a short half-life: 2-4 min.; 
safety: the compounds of the invention are not made from 
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human blood products; and cost-effectiveness: the 
compounds of the invention are recombinantly produced. 
The activity of the compounds is timed-released, 
therefore they can be administered in a single dose. The 
5 time required to achieve SK activity may also be modified 
depending on the number of amino-terminal amino acids 
removed from the nSK, i.e., length of time required is 
directly proportional to the number of amino acids 
deleted. In this manner, the timed-release activity of 

10 SK can be customized to suit the specific clinical 
application or patient to be treated. Thus, the 
compounds of the invention are improved clinical reagents 
because, using modified rSKs, an active SK-PAC can be 
generated at a rate consistent with best thrombolytic 

15 results . 

EXAMPLE 2; Site-dir ected streptokinase mutants resist 

cleavage and degradation bv plasmin 

To examine the effects of cleavage on the activity 
of SK, site-directed mutations of R or L to A at putative 
20 plasmin or trypsin cleavage sites in the amino and 

car boxy terminus of SK were generated. The cleavage rate 
of these recombinant SKs were then examined. The 
catalytic function of rSKs with these specific mutations 
was also evaluated. 



25 SK clon ing and mutation bv overlap extension 

The SK gene was cloned from Group c Streptococcus 
eguisimilis as described above. A series of mutations 
was performed in the amino terminus of SK to replace R or 
K residues with an A residue at putative plasmin cleavage 

30 sites. In addition, a single K to A mutation was 

constructed for K386 in the carboxy terminus of SK. PGR 
primers were used to produce site-directed mutations by 
the overlap extension method. For example, using nSK in 
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the pMAL vector as a template^ PGR was performed using a 
primer corresponding to the mal E sequence of the pMALc 
vector and the SK 10 AS primer. At the same time the SK 
10 S primer was used in a PGR reaction with a SK 36 AS 
5 primer* The PGR products were purified on a low-melt 
agarose gel and used in an overlap PGR reaction. The 
overlapped product was then further amplified using the 
mal E primer and the SK 36 AS primer. In a similar 
fashion, the primers were used to construct mutations at 

10 the 45 and 51 position. The final overlap construct was 
between the 5' overlapped mutated SK segment containing 
the mutations at SK 10, 36, 45, and 51 and the segment 
from 51 to 127. This overlapped fragment was then 
ligated into the pMALc nSK, replacing the wild type 

15 sequence, between restriction sites for ICpnl and Aflll. 
The SK 59 mutation was separately constructed and used to 
replace the wild type sequence between ATI II and Muni. 
The mutation at residue 386 was similarly constructed and 
ligated into SK using a Hindlll site. The mutated 

20 pMALcSKs were sequenced to verify the desired mutations. 
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Table 5. Primers for Muiial^ion by Overlap Rvt-g^ng^ 



25 



Primer Mutation 
Restriction Site 
SK 10 S R->A 



SK 10 AS 



SK 36 S K->A 



10 SK 36 AS 



SK 45 S R->A 



SK 45 AS 

15 



SK 51 S K->A 



SK 51 AS 



20 SK 59 S K->A 



SK 59 AS 



SK 386S K->A 



Primer Secruence 

5 ' -GCTGCTAGACGCGCCATCTGTCAAC 
(SEQ ID NO: 5) 

5 ' -TGGCGCGTCTA6CAGCCACTCAG 
(SEQ ID NO: 6) 

5 ' -CAAGACATTAGTCTG6CCTTTTTTGAAATCG 
(SEQ ID NO: 7) 

5 ' -GGCCAGACTAATGTCTTGATTCG 
(SEQ ID NO:8} 

5 ' -CGATCTAACATCGGCGCCTGCTCATGG 
(SEQ ID NO: 9) 

5 ' -CGCCGATGTTAGATCGATTTC 
(SEQ ID NO: 10) 

5 ' -GCTCATGGAGGCGCCACAGAGGGC 
(SEQ ID NO: 11) 

5 ' -GGCGCCTCCATGAGCAGGTC 
(SEQ ID NO: 12) 

5 ' -GCTTAAGTCCGGCCTCAAAACCATTTGC 
(SEQ ID NO: 13) 

5 ' -TGAGGCCG6ACTTAAGCCTTGCTC 
(SEQ ID NO: 14) 

5 ' -^CCGATCGATATACCGAAGAAGAACGAG 
(SEQ ID NO: 15) 

5 ' -TATCGATCGGCATCATAGGCTAAATGATAGC 

(SEQ ID NO: 16) 



Hhal 



Haelll 



Narl 



Narl 



Haelll 



Clal 



30 



35 



Plasmin-resisiian t SK ai-hfi mutant.^ 

The following plasmin cleavage sites can be 
mutated: RIGA, K36A, R45A, K51A, K59A, K61A, K147A, K333, 
R232A, K257A, K298A, K309A, R234A, R363A, K386A, K372A, 
R388A, R394A, and R401A. Single mutants K59A, K386A, 
were made, and the multiple mutant containing RIGA, K36A, 
R45A, K51A, and K59A (rSKmutS) was studied further. 
Purification of rSKSmut is shown in Fig. i. Multiple 
mutant rSKSmut is identical to rSKSmut with the addition 
of another mutation at a car boxy- terminal potential 
plasmin cleavage site. This mutant contains the 
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following mutations: RlOA, K36A, R45A, K51A, K59A and 
k386A. 

The plasmin-resistant SK site mutants produce 
catalytically-active plasmin cleavage products which are 
larger than those generated from nSK (see Figs. 6 and 7) 
The rate of degradation of rSKSmut is also slower than 
that of nSK ( see Figs . 6 and 7 ) . 

Kinetic studies were performed to examine the 
catalytic activity of the site mutants. Table 6 shows 
the results from kinetic studies for rSKSmut and Glu- 
plasminogen. These data show that mutation of plasmin 
cleavage sites significantly decreases the of SK 
amidolytic activity leading to greater catalytic 
efficiency, and thus, greater therapeutic efficacy. 

Table 6: Kinetic Par ameters for recombinant SKs and 

Glu-Plasminoaen 



(MM) (S^^) 

nSK 248 56 0.226 

rSK 152 42 0.276 

rSKAl4 533 51 0.096 

rSKSmut 77 52 0.675 
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Table 7: rSKSmut 

MKTEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDG 

PDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEAL 

SLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFK 

YENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTIN 

GPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYL 

LTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWY 

AVRTAVINAASGRQTVDEALKDAQTNSSSVPGRGSIEGRIAGPEWLLDAPSVNNSQL 

WSVAGTVEGTNQDISLAFFEIDLTSAPAHGGATEQGLSPASKPFATDSGAMSHKLE 

KADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPV 

QEFLLSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAI 

GDTITSQELIAQAQSIUJKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVKN 

REQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKYVDVDT 

NELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDDT 

NRIITVYMGKRPEGENASYHLAYDKDRYTEEEREVYSYLRYTGTPIPDNPNDKNNSQ 

LWSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKL 

EKADLLKAIQEQLIJ^NVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQP 

VQEFLLSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLA 

I6DTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVK 

NREQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKYVDVD 

TNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDD 

TNRIITVYMGKRPEGENASYHLAYDKDRYTEEEREVYSYLRYTGTPIPDNPNDK 
(SEQ ID NO: 17) 
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Table 8! rSK6mu1i 

MKTEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDG 

PDIIFWAHDRFGGYAQSGLIAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEAL 

SLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFK 

YENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTIN 

GPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYL 

LTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWY 

AVRTAVINAASGRQTVDEALKDAQTNSSSVPGRGSIEGRIAGPEWLLDAPSVNNSQL 

WSVAGTVEGTNQDISLAFFEIDLTSAPAHGGATEQGLSPASKPFATDSGAMSHKLE 

KADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPV 

QEFLLSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAI 

GDTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVKN 

REQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKYVDVDT 

NELIJCSEQLLTASERNLDPRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDDT 

NRIITVYMGKRPEGENASYHLAYDADRYTEEEREVYSYLRYTGTPIPDNPNDKNNSQ 

LWSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKL 

EKADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQP 

VQEFLLSGHVRVRYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLA 

IGDTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVK 

NREQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKYVDVD 

TNELLKSEQIJjTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDD 

TNRIITVYMGKRPEGENASYHLAYDKDRYTEEEREVYSYLRYTGTPIPDNPNDK 
(SEQ ID NO: 18) 
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Table 9; DNA sequence of SK from S. eauisimilus H46A 

1 ctgcagctac ctgataccag gcatttccaa caaacatggt taaggccaaa 
51 ccaaaatcac tttctagcgt tggcaagaga ccttcaagcg agcgcaagac 
101 ctttattgaa gttgcttgtc gacataaaaa tgctgtttgg gttgtgctga 
151 taggcaaaat gacctcaagc cctgcaatca tctgctggag caactcaact 
201 aagtcagctg gtaaaacctg ctgatgattg aggtaaataa actgagaagt 
251 ctcaaacagc tgagggggat tgccctgatg atcaagcaaa taccgctgcc 
301 aaggtgaccc tagcggctgc aagacctcat attgacccaa ccccacctca 
351 agtaataagc gctctttttc ggataaacat gatttgggaa aatgcacata 
401 ttggtcccct tctttgacac tcacccactc tttatctcct aacggatgag 
451 ggcctacttg catctctgga aaatagtctt ttagctccat agccattcct 
501 ttcatgacgg tctttaaacc attataacac atgactcttt atcacacagt 
551 tcagtttgtt gtcagcacga ttttgtattt tctgcctttt taatcattaa 
601 aactaaataa gggttattca tttttagcaa gaacattcaa ttaaatagct 
651 atttatcgga atattaattt atgtttatgc taaaaaaggt attatttacc 
701 ttttttcatt gtcattaaaa tatcatttta aaaaaatcaa taggttttta 
751 tttgtgtctt taaaaccatt atgttattct aataatgggg attgaaactt 
801 aacttttagg aggtttctat gaaaaattac ttatcttttg ggatgtttgc 
851 actgctgttt gcactaacat ttggaacagt caattctgtc caagctattg 
901 ctggacctga gtggctgcta gaccgtccat ctgtcaacaa cagccaatta 
951 gttgttagcg ttgctggtac tgttgagggg acgaatcaag acattagtct 
1001 taaatttttt gaaatcgatc taacatcacg acctgctcat aggaaaga 
1051 cagagcaagg cttaagtcca aaatcaaaac catttgctac tgatagtggc 
1101 gcgatgtcac ataaacttga gaaagctgac ttactaaagg ctattcaaga 
1151 acaatt:gat:c gctaacgtcc acagtaacga cgactacttt gaggtcattg 
1201 attttgcaag cgatgcaacc attactgatc gaaacggcaa ggtctacttt 
1251 gctgacaaag atggttcggt aaccttgccg acccaacctg tccaagaatt 
1301 tttgctaagc ggacatgtgc gcgttagacc atataaagaa aaaccaatac 
1351 aaaaccaagc gaaatctgtt gatgtggaat atactgtaca gtttactccc 
1401 ttaaaccctg atgacgattt cagaccaggt ctcaaagata ctaagctatt 
1451 gaaaacacta gctatcggtg acaccatcac atctcaagaa ttactagctc 
1501 aagcacaaag cattttaaac aaaaaccacc caggctatac gatttatgaa 
1551 cgtgactcct caatcgtcac tcatgacaat gacattttcc gtacgatttt 
1601 accaatggat caagagttta cttaccgtgt taaaaatcgg gaacaagctt 
1651 ataggatcaa taaaaaatct ggtctgaatg aagaaataaa caacactgac 
1701 ctgatctctg agaaatatta cgtccttaaa aaaggggaaa agccgtatga 
1751 tccctttgat cgcagtcact tgaaactgtt caeca tcaaa tacgttgatg 
1801 -tcgataccaa cgaattgcta aaaagtgagc agctcttaac agctagcgaa 
1851 cgtaacttag acttcagaga tttatacgat cctcgtgata aggctaaact 
1901 actctacaac aatctcgatg cttttggtat tatggactat accttaactg 
1951 gaaaagtaga ggataatcac gatgacacca accgtatcat aaccgtttat 
2001 atgggcaagc gacccgaagg agagaatgct agctatcatt tagcctatga 
2051 taaagatcgt tataccgaag aagaacgaga agtttacagc tacctgcgtt 
2101 a-tacagggac acctatacct gataacccta acgacaaata accacggtct 
2151 -bctaaaacga tgagattaac tgacaaaaaa agcaagcaac atgctatcaa 
2201 cagttgcttg cttttttcta acctcttagt tgtagagact agtgacattt 
2251 cgtgtctaaa ataatcgtaa ctggtccatc attgatgaga ctaacctgca 
2301 tatctgcccc aaaaacgcca cgctcaactg gcacaaaatc tgccaattgt 
2351 tcattaaagc gatcataaaa ctggctagcc atatcagctt tgcagctcct 
2401 gtaaaggctg ggcgatttcc ctttttggtg tcagcataaa gggtaaattg 
2451 cgacacagat aagatactac ccttgatgtc ttggatagac tgattcatct 
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2501 tgccatcagc atctgaaaaa atgcgcatgt tgactatttt tgcacagcgt 
2551 aagccaaatc ttctgcag 
(SEQ ID NO: 19) 

SK coding sequence spans nucleotides 819-2138; coding 
sequence of mature peptide spans nucleotides 897-2138. 

Table 10 ; PNA sequence of MBP* 

atgaaaactg aagaaggtaa actggtaatc tggattaacg gcgataaagg 
ctataacggt ctcgctgaag tcggtaagaa attcgagaaa gataccggaa 
ttaaagtcac cgttgagcat ccggataaac tggaagagaa attcccacag 
gttgcggcaa ctggcgatgg ccctgacatt atcttctggg cacacgaccg 
ctttggtggc tacgctcaat ctggcctgtt ggctgaaatc accccggaca 
aagcgttcca ggacaagctg tatccgttta cctgggatgc cgtacgttac 
aacggcaagc tgattgctta cccgatcgct gttgaagcgt tatcgctgat 
ttataacaaa gatctgctgc cgaacccgcc aaaaacctgg gaagagatcc 
cggcgctgga taaagaactg aaagcgaaag gtaagagcgc gctgatgttc 
aacctgcaag aaccgtactt cacctggccg ctgattgctg ctgacggggg 
ttatgcgttc aagtatgaaa acggcaagta cgacattaaa gacgtgggcg 
tggataacgc tggcgcgaaa gcgggtctga ccttcctggt tgacctgatt 
aaaaacaaac acatgaatgc agacaccgat tactccatcg cagaagctgc 
ctttaataaa ggcgaaacag cgatgaccat caacggcccg tgggcatggt 
ccaacatcga caccagcaaa gtgaattatg gtgtaacggt actgccgacc 
ttcaagggtc aaccatccaa accgttcgtt ggcgtgctga gcgcaggtat 
taacgccgcc agtccgaaca aagagctggc gaaagagttc ctcgaaaact 
atctgctgac tgatgaaggt ctggaagcgg ttaataaaga caaaccgctg 
ggtgccgtag cgctgaagtc ttacgaggaa gagttggcga aagatccacg 
tattgccgcc accatggaaa acgcccagaa aggtgaaatc atgccgaaca 
tcccgcagat gtccgctttc tggtatgccg tgcgtactgc ggtgatcaac 
gccgccagcg gtcgtcagac tgtcgatgaa gccctgaaag acgcgcagac 
taattcgagc tcggtacccg gccggggatc catcgaggqt aqa 
(SEQ ID NO: 20) 

sequence represents cDNA sequence of MBP up to the 
restriction site in the polylinker where cDNA encoding SK 
was inserted. 



wo 96/41883 



PCT/US96/09640 



24 



SEQUENCE X.ISTXN6 



(1) GENERAL INFORMATION: 

(1) APPLICANTS President and Fellows of Harvard College 
(11) TITLE OF INVENTIONS PLASMIN-RESISTANT STREPTOKINASE 
(111) NUMBER OF SEQUENCES: 20 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.O. 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US96/ 

(B) FILING DATE: 07-JUN-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/488,940 

(B) FILING DATE: 09-JUN-1995 

(viii) ATTCnoVEY/AGENT INFORMATION: 

(A) NAME: Fraser, Janis K. 

(B) REGISTRATION NUMBER: 34,819 

(C) REFERENCE/DOCKET NUMBER: 05433/009W01 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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Met Lys Xhr Glu Glu Gly Lys Leu Val lie Trp He Asn Gly Asp Lys 



15 



Gly Tyr Asn Gly i.eu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 

25 30 



Oly lie Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 

^ 45 



Pro Gin val Ala Ala Thr Gly Asp Gly Pro Asp lie He Phe Trp Ala 
His ASP Arg Phe Gly Gly Tyr Ala Gin Ser Gly Leu Leu Ala Glu Xle 



75 80 



Thr Pro Asp Lys Ala Phe Gin Asp Lys Leu Tyr Pro Phe Thr Trp Asp 



90 95 



Ala Val Arg Tyr Asn Gly Lys Leu He Ala Tvr- ti 

100 Tnf Ala Val Glu 

110 

Ala Leu ser Leu Xle Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 

Thr Trp Glu Glu lie Pro Ala Leu Asp Lys Glu Leu Ly! Ala Lys Gly 

Lys ser Ala Leu Met Phe Asn Leu Gin Glu Pro Tyr Phe Thr Trp Pro 

160 

Leu lie Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 

170 275 
Tyr Asp He Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 

las 290 
Leu Thr Phe Leu Val Asp Leu He Lys Asn Lys His Met Asn Ala Asp 

Thr ASP Tyr Ser He Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 

220 

Met Thr He Asn Gly Pro Trp Ala Trp Ser Asn He Asp Thr Ser Lys 

235 240 
val Asn Tyr Gly Vai Thr Val Leu Pro Thr Phe Lys Gly Gin Pro Ser 

250 255 
Lys Pro Phe Val Gly Val Leu Ser Ala Gly He Asn Ala Ala Ser Pro 

^55 270 
Asn Lys Glu Leu Ala Lys Glu Phe Leu ri„ aor, m 

285 

Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 



300 



Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg He Ala Ala 



320 



Thr Met Glu Asn Ala Gin Lys Gly Glu He Met Pro Asn He Pro Gin 

330 
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Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val lie Asn Ala Ala 

340 345 350 

Ser Gly Arg Gin Thr Val Asp Glu Ala Leu Lys Asp Ala Gin Thr Asn 
355 360 365 

Ser Ser Ser Val Pro Gly Arg Gly Ser lie Glu Gly Arg lie Ala Gly 
370 375 380 

Pro Glu Trp Leu Leu Asp Arg Pro Ser Val Asn Asn Ser Gin Leu Val 
385 390 395 400 

Val Ser Val Ala Gly Thr Val Glu Gly Thr Asn Gin Asp lie Ser Leu 

405 410 415 

Lys Phe Phe Glu lie Asp Leu Thr Ser Arg Pro Ala His Gly Gly Lys 

420 425 430 

Thr Glu Gin Gly Leu Ser Pro Lys Ser Lys Pro Phe Ala Thr Asp Ser 
435 440 445 

Gly Ala Met Ser His Lys Leu Glu Lys Ala Asp Leu Leu Lys Ala He 
450 455 460 

Gin Glu Gin Leu He Ala Asn Val His Ser Asn Asp Asp Tyr Phe Glu 
465 470 475 480 

Val He Asp Phe Ala Ser Asp Ala Thr He Thr Asp Arg Asn Gly Lys 

485 490 495 

Val Tyr Phe Ala Asp Lys Asp Gly Ser Val Thr Leu Pro Thr Gin Pro 

500 505 510 

Val Gin Glu Phe Leu Leu Ser Gly His Val Arg Val Arg Tyr Lys Glu 
515 520 525 

Lys Pro He Gin Asn Gin Ala Lys Ser Val Asp Val Glu Tyr Thr Val 
530 535 540 

Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp Phe Arg Pro Gly Leu Lys 
545 550 555 560 

Asp Thr Lys Leu Leu Lys Thr Leu Ala He Gly Asp Thr He Thr Ser 

565 570 575 

Gin Glu Leu Leu Ala Gin Ala Gin Ser He Leu Asn Lys Asn His Pro 

580 585 590 

Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser He Val Thr His Asp Asn 
595 600 605 

Asp He Phe Arg Thr He Leu Pro Met Asp Gin Glu Phe Thr Tyr Arg 
610 615 620 

Val Lys Asn Arg Glu Gin Ala Tyr Arg He Asn Lys Lys Ser Gly Leu 
625 630 635 640 

Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys Tyr Tyr Val 

645 650 655 

Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Arg Ser His Leu 

660 665 670 
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Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn Glu Leu Leu 
675 680 685 

Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu Asp Phe Aro 
690 695 700 

Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr Asn Asn Leu 
705 710 715 720 

Asp Ala Phe Gly He Met Asp Tyr Thr Leu Thr Gly Lys Val Glu Asp 

725 730 735 

Asn His Asp Asp Thr Asn Arg He He Thr Val Tyr Met Gly Lys Aro 

740 745 750 

Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp Lys Asp Arg 
755 760 765 

Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Arg Tyr Thr Gly 
770 775 780 

Thr Pro He Pro Asp Asn Pro Asn Asp Lys Asn Asn Ser Gin Leu Val 
785 790 795 800 

Val Ser Val Ala Gly Thr Val Glu Gly Thr Asn Gin Asp He Ser Leu 

805 810 815 

Lys Phe Phe Glu He Asp Leu Thr Ser Arg Pro Ala His Gly Gly Lys 

820 825 830 

Thr Glu Gin Gly Leu Ser Pro Lys Ser Lys Pro Phe Ala Thr Asp Ser 
835 840 845 

Gly Ala Met Ser His Lys Leu Glu Lys Ala Asp Leu Leu Lys Ala He 
850 855 860 

Gin Glu Gin Leu He Ala Asn Val His Ser Asn Asp Asp Tyr Phe Glu 

870 875 880 

Val He Asp Phe Ala Ser Asp Ala Thr He Thr Asp Arg Asn Gly Lys 

885 890 895 

Val Tyr Phe Ala Asp Lys Asp Gly Ser Val Thr Leu Pro Thr Gin Pro 

900 905 910 

Val Gin Glu Phe Leu Leu Ser Gly His Val Arg Val Arg Tyr Lys Glu 
915 920 925 

Lys Pro He Gin Asn Gin Ala Lys Ser Val Asp Val Glu Tyr Thr Val 
930 935 940 

Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp Phe Arg Pro Gly Leu Lvs 
945 950 955 960 

Asp Thr Lys Leu Leu Lys Thr Leu Ala He Gly Asp Thr He Thr Ser 

965 970 975 

Gin Glu Leu Leu Ala Gin Ala Gin Ser He Leu Asn Lys Asn His Pro 

980 985 990 

Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser He Val Thr His Asp Asn 
995 1000 1005 
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Asp lie Phe Arg Thr He Leu Pro Met Asp Gin Glu Phe Thr Tyr Arg 
1010 1015 1020 

Val Lys Ash Arg Glu Gin Ala Tyr Arg He Asn Lys Lys Ser 61y Leu 
1025 1030 1035 1040 

Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys Tyr Tyr Val 

1045 1050 1055 

Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Arg Ser His Leu 

1060 1065 1070 

Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn Glu Leu Leu 
1075 1080 1085 

Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu Asp Phe Arg 
1090 1095 1100 

Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr Asn Asn Leu 
1105 1110 1115 1120 

Asp Ala Phe Gly He Met Asp Tyr Thr Leu Thr Gly Lys Val Glu Asp 

1125 1130 1135 

Asn His Asp Asp Thr Asn Arg He He Thr Val Tyr Met Gly Lys Arg 

1140 1145 1150 

Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp Lys Asp Arg 
1155 1160 1165 

Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Arg Tyr Thr Gly 
1170 1175 1180 

Thr Pro He Pro Asp Asn Pro Asn Asp Lys 
1185 1190 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Thr Glu Glu Gly Lys Leu Val He Trp He Asn Gly Asp Lys 
15 10 15 

Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 

20 25 30 

Gly He Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 
35 40 45 

Pro Gin Val Ala Ala Thr Gly Asp Gly Pro Asp He He Phe Trp Ala 
50 55 60 
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His Asp Arg Phe Gly Gly Tyr Ala Gin Ser Gly Leu Leu Ala Glu lie 
65 70 75 80 

Thr Pro Asp Lys Ala Phe Gin Asp Lys Leu Tyr Pro Phe Thr Tro Asp 

85 90 95 

Ala Val Arg Tyr Asn Gly Lys Leu He Ala Tyr Pro He Ala Val Glu 

100 105 110 

Ala Leu Ser Leu He Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 
115 120 125 

Thr Trp Glu Glu He Pro Ala Leu Asp Lys Glu Leu Lys Ala Lvs Glv 
130 135 140 Jr jr 

Lys Ser Ala Leu Met Phe Asn Leu Gin Glu Pro Tyr Phe Thr Tro Pro 
145 150 155 160 

Leu He Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 

165 170 175 

Tyr Asp He Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 

180 185 190 

Leu Thr Phe Leu Val Asp Leu He Lys Asn Lys His Met Asn Ala Asp 
195 200 205 

Thr Asp Tyr Ser He Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
210 215 220 

Met Thr He Asn Gly Pro Trp Ala Trp Ser Asn He Asp Thr Ser Lys 
225 230 235 240 

Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gin Pro Ser 

245 250 255 

Lys Pro Phe Val Gly Val Leu Ser Ala Gly He Asn Ala Ala Ser Pro 

260 265 270 

Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
275 280 285 

Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 
290 295 300 

Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg He Ala Ala 
305 310 315 320 

Thr Met Glu Asn Ala Gin Lys Gly Glu He Met Pro Asn He Pro Gin 

325 330 335 

Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val He Asn Ala Ala 

340 345 350 

Ser Gly Arg Gin Thr Val Asp Glu Ala Leu Lys Asp Ala Gin Thr Asn 
355 360 365 

ser Ser Ser Val Pro Gly Arg Gly Ser He Glu Gly Arg Asn Asn Ser 
370 375 380 

Gin Leu Val Val Ser Val Ala Gly Thr Val Glu Gly Thr Asn Gin Asp 
385 390 395 400 
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He Ser Leu Lys Phe Phe Glu He Asp Leu Thr Ser Arg Pro Ala His 

405 410 

Gly Gly Lys Thr Glu Gin Gly Leu Ser Pro Lys Ser Lys Pro Phe Ala 

420 425 430 

Thr Asp Ser Gly Ala Met Ser His Lys Leu Glu Lys Ala Asp Leu Leu 
435 440 445 

Lys Ala He Gin Glu Gin Leu He Ala Asn Val His Ser Asn Asp Asp 
450 455 ^ 

Tyr Phe Glu Val He Asp Phe Ala Ser Asp Ala Thr He Thr Asp Arg 

470 475 

Asn Gly Lys Val Tyr Phe Ala Asp Lys Asp Gly Ser Val Thr Leu Pro 

485 490 495 

Thr Gin Pro Val Gin Glu Phe Leu Leu Ser Gly His Val Arg Val Arg 

500 505 510 

Tyr Lys Glu Lys Pro He Gin Asn Gin Ala Lys Ser Val Asp Val Glu 
515 520 525 

Tyr Thr Val Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp Phe Arg Pro 

535 540 

Gly Leu Lys Asp Thr Lys Leu Leu Lys Thr Leu Ala He Gly Asp Thr 

550 555 560 

He Thr Ser Gin Glu Leu Leu Ala Gin Ala Gin Ser He Leu Asn Lvs 

565 570 575 

Asn His Pro Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser He Val Thr 

580 585 590 

His Asp Asn Asp He Phe Arg Thr He Leu Pro Met Asp Gin Glu Phe 
595 600 605 

Thr Tyr Arg Val Lys Asn Arg Glu Gin Ala Tyr Arg He Asn Lys Lys 
^-"•O 615 620 

Ser Gly Leu Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys 

630 635 640 

Tyr Tyr Val Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Aro 

^45 650 655 

Ser His Leu Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn 

550 665 670 

Glu Leu Leu Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu 
575 680 685 

Asp Phe Arg Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr 

695 700 

Asn Asn Leu Asp Ala Phe Gly He Met Asp Tyr Thr Leu Thr Gly Lys 
'"S 710 715 720 

Val Glu Asp Asn His Asp Asp Thr Asn Arg He He Thr Val Tyr Met 

725 730 735 
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Gly Lys Arg Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp 

740 745 750 

Lys Asp Arg Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Ara 
755 760 765 

Tyr Thr Gly Thr Pro He Pro Asp Asn Pro Asn Asp Lys Asn Asn Ser 
770 775 780 

Gin Leu Val Val Ser Val Ala Gly Thr Val Glu Gly Thr Asn Gin Ast> 
7B5 790 795 800 

He Ser Leu Lys Phe Phe Glu He Asp Leu Thr Ser Arg Pro Ala His 

805 810 815 

Gly Gly Lys Thr Glu Gin Gly Leu Ser Pro Lys Ser Lys Pro Phe Ala 

820 825 830 

Thr Asp Ser Gly Ala Met Ser His Lys Leu Glu Lys Ala Asp Leu Leu 
835 840 845 

Lys Ala He Gin Glu Gin Leu He Ala Asn Val His Ser Asn Asp Asp 
850 855 860 

Tyr Phe Glu Val He Asp Phe Ala Ser Asp Ala Thr He Thr Asp Ara 
865 870 875 880 

Asn Gly Lys Val Tyr Phe Ala Asp Lys Asp Gly Ser Val Thr Leu Pro 

885 890 895 

Thr Gin Pro Val Gin Glu Phe Leu Leu Ser Gly His Val Arg Val Arg 

900 905 910 

Tyr Lys Glu Lys Pro He Gin Asn Gin Ala Lys Ser Val Asp Val Glu 
915 920 925 

Tyr Thr Val Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp Phe Ara Pro 
930 935 940 

Gly Leu Lys Asp Thr Lys Leu Leu Lys Thr Leu Ala He Gly Asp Thr 
945 950 955 960 

He Thr Ser Gin Glu Leu Leu Ala Gin Ala Gin Ser He Leu Asn Lys 

965 970 975 

Asn His Pro Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser He Val Thr 

980 985 990 

His Asp Asn Asp He Phe Arg Thr He Leu Pro Met Asp Gin Glu Phe 
995 1000 1005 

Thr Tyr Arg Val Lys Asn Arg Glu Gin Ala Tyr Arg He Asn Lys Lys 
1010 1015 1020 

Ser Gly Leu Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys 
1025 1030 1035 1040 

Tyr Tyr Val Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Arg 

1045 1050 1055 

Ser His Leu Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn 

1060 1065 1070 
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Glu Leu Leu Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu 
1075 1080 1085 

Asp Phe Arg Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr 
1090 1095 1100 

Asn Asn Leu Asp Ala Phe Gly lie Met Asp Tyr Thr Leu Thr Gly Lys 
1105 1110 1115 1120 

Val Glu Asp Asn His Asp Asp Thr Asn Arg lie lie Thr Val Tyr Met 

1125 1130 1135 

Gly Lys Arg Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp 

1140 1145 1150 

Lys Asp Arg Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Arg 
1155 1160 1165 

Tyr Thr Gly Thr Pro lie Pro Asp Asn Pro Asn Asp Lys 
1170 1175 1180 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

lie Ala Gly Pro Glu Trp Leu Leu Asp Arg Pro Ser Val Asn Asn Ser 
1 5 10 15 

Gin Leu Val Val Ser Val Ala Gly Thr Val Glu Gly Thr Asn Gin Asp 

20 25 30 

lie Ser Leu Lys Phe Phe Glu lie Asp Leu Thr Ser Arg Pro Ala His 
35 40 45 

Gly Gly Lys Thr Glu Gin Gly Leu Ser Pro Lys Ser Lys Pro Phe Ala 
50 55 60 

Thr Asp Ser Gly Ala Met Ser His Lys Leu Glu Lys Ala Asp Leu Leu 
^5 70 75 80 

Lys Ala lie Gin Glu Gin Leu lie Ala Asn Val His Ser Asn Asp Asp 

85 90 95 

Tyr Phe Glu Val He Asp Phe Ala Ser Asp Ala Thr He Thr Asp Arg 

100 105 110 

Asn Gly Lys Val Tyr Phe Ala Asp Lys Asp Gly Ser Val Thr Leu Pro 
115 120 125 

Thr Gin Pro Val Gin Glu Phe Leu Leu Ser Gly His Val Arg Val Ara 
130 135 140 
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Tyr Lys Glu Lys Pro He Gin Asn Gin Ala Lys Ser Val Asp Val Glu 
145 150 155 160 

Tyr Thr Val Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp Phe Arg Pro 

165 170 175 

Gly Leu Lys Asp Thr Lys Leu Leu Lys Thr Leu Ala He Gly Asp Thr 

180 185 190 

He Thr Ser Gin Glu Leu Leu Ala Gin Ala Gin Ser He Leu Asn Lvs 
195 200 205 

Asn His Pro Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser He Val Thr 
210 215 220 

His Asp Asn Asp He Phe Arg Thr He Leu Pro Met Asp Gin Glu Phe 
225 230 235 240 

Thr Tyr Arg Val Lys Asn Arg Glu Gin Ala Tyr Arg He Asn Lys Lys 

245 250 255 

Ser Gly Leu Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys 

260 265 270 

Tyr Tyr Val Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Aro 
275 280 285 *- ^ 

Ser His Leu Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn 
290 295 300 

Glu Leu Leu Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu 
305 310 315 320 

Asp Phe Arg Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr 

325 330 335 

Asn Asn Leu Asp Ala Phe Gly He Met Asp Tyr Thr Leu Thr Gly Lys 

340 345 350 

Val Glu Asp Asn His Asp Asp Thr Asn Arg He He Thr Val Tyr Met 
355 360 365 

Gly Lys Arg Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp 
370 375 380 

Lys Asp Arg Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Ara 
385 390 395 400 

Tyr Thr Gly Thr Pro He Pro Asp Asn Pro Asn Asp Lys Asn Asn Ser 

405 410 415 

Gin Leu Val Val Ser Val Ala Gly Thr Val Glu Gly Thr Asn Gin Asp 

420 425 430 

He Ser Leu Lys Phe Phe Glu He Asp Leu Thr Ser Arg Pro Ala His 
435 440 

Gly Gly Lys Thr Glu Gin Gly Leu Ser Pro Lys Ser Lys Pro Phe Ala 
450 455 460 

Thr Asp Ser Gly Ala Met Ser His Lys Leu Glu Lys Ala Asp Leu Leu 
465 470 475 480 
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Lys Ala lie Gin Glu Gin Leu He Ala Asn Val His Ser Asn Asp Asp 

485 490 495 

Tyr Phe Glu Val He Asp Phe Ala Ser Asp Ala Thr He Thr Asp Arg 

500 505 510 

Asn Gly Lys Val Tyr Phe Ala Asp Lys Asp Gly Ser Val Thr Leu Pro 
515 520 525 

Thr Gin Pro Val Gin Glu Phe Leu Leu Ser Gly His Val Arg Val Arg 
530 535 540 

Tyr Lys Glu Lys Pro He Gin Asn Gin Ala Lys Ser Val Asp Val Glu 
545 550 555 560 

Tyr Thr Val Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp Phe Arg Pro 

565 570 575 

Gly Leu Lys Asp Thr Lys Leu Leu Lys Thr Leu Ala He Gly Asp Thr 

580 585 590 

He Thr Ser Gin Glu Leu Leu Ala Gin Ala Gin Ser He Leu Asn Lys 
595 600 605 

Asn His Pro Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser He Val Thr 
610 615 620 

His Asp Asn Asp He Phe Arg Thr He Leu Pro Met Asp Gin Glu Phe 
625 630 635 640 

Thr Tyr Arg Val Lys Asn Arg Glu Gin Ala Tyr Arg He Asn Lys Lys 

645 650 655 

Ser Gly Leu Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys 

660 665 670 

Tyr Tyr Val Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Ara 
675 680 685 

Ser His Leu Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn 
690 695 700 

Glu Leu Leu Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu 
705 710 715 720 

Asp Phe Arg Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr 

725 730 735 

Asn Asn Leu Asp Ala Phe Gly He Met Asp Tyr Thr Leu Thr Gly Lys 

740 745 750 

Val Glu Asp Asn His Asp Asp Thr Asn Arg He He Thr Val Tyr Met 
755 760 765 

Gly Lys Arg Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp 
770 775 780 

Lys Asp Arg Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Aro 
785 790 795 800 

Tyr Thr Gly Thr Pro He Pro Asp Asn Pro Asn Asp Lys 

805 810 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asn Asn Ser Gin Leu Val Val Ser Val Ala Gly Thr Val Glu Gly Thr 
15 10 15 

Asn Gin Asp lie Ser Leu Lys Phe Phe Glu lie Asp Leu Thr Ser Arg 

20 25 30 

Pro Ala His Gly Gly Lys Thr Glu Gin Gly Leu Ser Pro Lys Ser Lys 
35 40 45 

Pro Phe Ala Thr Asp Ser Gly Ala Met Ser His Lys Leu Glu Lys Ala 
50 55 60 

Asp Leu Leu Lys Ala He Gin Glu Gin Leu He Ala Asn Val His Ser 
65 70 75 80 

Asn Asp Asp Tyr Phe Glu Val He Asp Phe Ala Ser Asp Ala Thr He 

85 90 95 

Thr Asp Arg Asn Gly Lys Val Tyr Phe Ala Asp Lys Asp Gly Ser Val 

100 105 110 

Thr Leu Pro Thr Gin Pro Val Gin Glu Phe Leu Leu Ser Gly His Val 
115 120 125 

Arg Val Arg Tyr Lys Glu Lys Pro He Gin Asn Gin Ala Lys Ser Val 
130 135 140 

Asp Val Glu Tyr Thr Val Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp 
145 150 155 160 

Phe Arg Pro Gly Leu Lys Asp Thr Lys Leu Leu Lys Thr Leu Ala He 

165 170 175 

Gly Asp Thr He Thr Ser Gin Glu Leu Leu Ala Gin Ala Gin Ser He 

180 185 190 

Leu Asn Lys Asn His Pro Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser 
195 200 205 

He Val Thr His Asp Asn Asp He Phe Arg Thr He Leu Pro Met Asp 
210 215 220 

Gin Glu Phe Thr Tyr Arg Val Lys Asn Arg Glu Gin Ala Tyr Arg He 
225 230 235 240 

Asn Lys Lys Ser Gly Leu Asn Glu Glu He Asn Asn Thr Asp Leu He 

245 250 255 
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Ser Glu Lys Tyr Tyr Val Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro 

260 265 270 

Phe Asp Arg Ser His Leu Lys Leu Phe Thr He Lys Tyr Val Asp Val 
275 280 285 

Asp Thr Asn Glu Leu Leu Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu 
290 295 300 

Arg Asn Leu Asp Phe Arg Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys 
305 310 315 320 

Leu Leu Tyr Asn Asn Leu Asp Ala Phe Gly He Met Asp Tyr Thr Leu 

325 330 335 

Thr Gly Lys Val Glu Asp Asn His Asp Asp Thr Asn Arg He He Thr 

340 345 350 

Val Tyr Met Gly Lys Arg Pro Glu Gly Glu Asn Ala Ser Tyr His Leu 
355 360 365 

Ala Tyr Asp Lys Asp Arg Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser 
370 375 380 

Tyr Leu Arg Tyr Thr Gly Thr Pro He Pro Asp Asn Pro Asn Asp Lys 
385 390 395 400 

Asn Asn Ser Gin Leu Val Val Ser Val Ala Gly Thr Val Glu Gly Thr 

405 410 415 

Asn Gin Asp He Ser Leu Lys Phe Phe Glu He Asp Leu Thr Ser Aro 

420 425 430 

Pro Ala His Gly Gly Lys Thr Glu Gin Gly Leu Ser Pro Lys Ser Lvs 
435 440 445 ' 

Pro Phe Ala Thr Asp Ser Gly Ala Met Ser His Lys Leu Glu Lys Ala 
450 455 460 

Asp Leu Leu Lys Ala He Gin Glu Gin Leu He Ala Asn Val His Ser 
465 470 475 480 

Asn Asp Asp Tyr Phe Glu Val He Asp Phe Ala Ser Asp Ala Thr He 

485 490 495 

Thr Asp Arg Asn Gly Lys Val Tyr Phe Ala Asp Lys Asp Gly Ser Val 

500 505 510 

Thr Leu Pro Thr Gin Pro Val Gin Glu Phe Leu Leu Ser Glv His Val 
515 520 525 

Arg Val Arg Tyr Lys Glu Lys Pro He Gin Asn Gin Ala Lys Ser Val 
530 535 540 

Asp Val Glu Tyr Thr Val Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp 
545 550 555 ^ 56O 

Phe Arg Pro Gly Leu Lys Asp Thr Lys Leu Leu Lys Thr Leu Ala He 

565 570 575 

Gly Asp Thr He Thr Ser Gin Glu Leu Leu Ala Gin Ala Gin Ser He 

580 585 590 
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Leu Asn Lys Asn His Pro Gly Tyr Thr lie Tyr Glu Arg Asp Ser Ser 
595 600 605 

lie Val Thr His Asp Asn Asp lie Phe Arg Thr lie Leu Pro Met Asp 
610 615 620 

Gin Glu Phe Thr Tyr Arg Val Lys Asn Arg Glu Gin Ala Tyr Arg lie 
625 630 635 640 

Asn Lys Lys Ser Gly Leu Asn Glu Glu lie Asn Asn Thr Asp Leu lie 

645 650 655 

Ser Glu Lys Tyr Tyr Val Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro 

660 665 670 

Phe Asp Arg Ser His Leu Lys Leu Phe Thr lie Lys Tyr Val Asp Val 
675 680 685 

Asp Thr Asn Glu Leu Leu Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu 
690 695 700 

Arg Asn Leu Asp Phe Arg Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lvs 
705 710 715 720 

Leu Leu Tyr Asn Asn Leu Asp Ala Phe Gly lie Met Asp Tyr Thr Leu 

725 730 735 

Thr Gly Lys Val Glu Asp Asn His Asp Asp Thr Asn Arg lie He Thr 

740 745 750 

Val Tyr Met Gly Lys Arg Pro Glu Gly Glu Asn Ala Ser Tyr His Leu 
755 760 765 

Ala Tyr Asp Lys Asp Arg Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser 
770 775 780 

Tyr Leu Arg Tyr Thr Gly Thr Pro He Pro Asp Asn Pro Asn Asp Lys 
785 790 795 800 

(2) INFORMATION FOR SEQ ID NOsS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STR21NDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GCTGCTAGAC GCGCCATCTG TCAAC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TGGCGCGTCT AGCAGCCACT CAG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAAGACATTA GTCTGGCCTT TTTTGAAATC 6 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGCCAGACTA ATGTCTTGAT TCG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGATCTAACA TCGGCGCCTG CTCATGG 
(2) INFORMATION FOR SEQ ID NO: 10: 



27 
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CGATCTAACA TCGGCGCCTG CTCATGG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
CGCCGATGTT AGATC6ATTT C 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GCTCATGGAG GCGCCACAGA GGGC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GGCGCCTCCA TGAGCAGGTC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 5 
GCTTAAGTCC GGCCTCAAAA CCATTTGC 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TGAGGCCGGA CTTAAGCCTT GCTC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCCGATOGAT ATACCGAAGA AGAACGAG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TATCGATCGG CATCATAGGC TAAATGATAG C 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Lys Thr Glu Glu Gly Lys Leu Val lie Trp lie Asn Gly Asp Lys 
^ S 10 15 

Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 

20 25 30 

Gly He Lys Val Thr Val Glu His Px^o Asp Lys Leu Glu Glu Lys Phe 
35 40 45 

Pro Gin Val Ala Ala Thr Gly Asp Gly Pro Asp He He Phe Trp Ala 
50 55 60 

His Asp Arg Phe Gly Gly Tyr Ala Gin Ser Gly Leu Leu Ala Glu He 
^5 70 75 80 

Thr Pro Asp Lys Ala Phe Gin Asp Lys Leu Tyr Pro Phe Thr Trp Asp 

85 90 95 

Ala Val Arg Tyr Asn Gly Lys Leu He Ala Tyr Pro He Ala Val Glu 

100 105 110 

Ala Leu Ser Leu He Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lvs 
115 120 125 

Thr Trp Glu Glu He Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Glv 
130 135 140 jr 

Lys Ser Ala Leu Met Phe Asn Leu Gin Glu Pro Tyr Phe Thr Tro Pro 
145 150 155 ^ 160 

Leu He Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 

165 170 175 

Tyr Asp He Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 

180 185 190 

Leu Thr Phe Leu Val Asp Leu He Lys Asn Lys His Met Asn Ala Asp 
195 200 205 

Thr Asp Tyr Ser He Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
210 215 220 

Met Thr He Asn Gly Pro Trp Ala Trp Ser Asn He Asp Thr Ser Lvs 
225 230 235 240 

Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gin Pro Ser 

245 250 255 

Lys Pro Phe Val Gly Val Leu Ser Ala Gly He Asn Ala Ala Ser Pro 

260 265 270 

Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
275 280 285 



wo 96/41883 



PCTAJS96/09640 



Glu Gly Leu Glu 
290 



Leu Lye Ser Tyr 
305 

Thr Met Glu Asn 



Met Ser Ala Phe 

340 

Ser Gly Arg Gin 
355 

Ser Ser Ser Val 
370 

Pro Glu Trp Leu 
385 

Val Ser Val Ala 



Ala Phe Phe Glu 

420 

Thr Glu Gin Gly 
435 

Gly Ala Met Ser 
450 

Gin Glu Gin Leu 
465 

Val lie Asp Phe 



Val Tyr Phe Ala 

500 

Val Gin Glu Phe 
515 

Lys Pro lie Gin 
530 

Gin Phe Thr Pro 
545 

Asp Thr Lys Leu 



Gin Glu Leu Leu 

580 

Gly Tyr Thr lie 
595 

Asp Xle Phe Arg 
610 
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Ala Val Asn Lys 
295 

Glu Glu Glu Leu 
310 

Ala Gin Lys Gly 
325 

Trp Tyr Ala Val 



Thr Val Asp Glu 

360 

Pro Gly Arg Gly 
375 

Leu Asp Ala Pro 
390 

Gly Thr Val Glu 
405 

lie Asp Leu Thr 



Leu Ser Pro Ala 

440 

His Lys Leu Glu 
455 

lie Ala Asn Val 
470 

Ala Ser Asp Ala 
485 

Asp Lys Asp Gly 



Leu Leu Ser Gly 

520 

Asn Gin Ala Lys 
535 

Leu Asn Pro Asp 
550 

Leu Lys Thr Leu 
565 

Ala Gin Ala Gin 



Tyr Glu Arg Asp 

600 

Thr lie Leu Pro 
615 



Asp Lys Pro Leu 

300 

Ala Lys Asp Pro 
315 

Glu He Met Pro 
330 

Arg Thr Ala Val 
345 

Ala Leu Lys Asp 



Ser He Glu Gly 

380 

Ser Val Asn Asn 
395 

Gly Thr Asn Gin 
410 

Ser Ala Pro Ala 
425 



Ser Lys Pro Phe 



Lys Ala Asp Leu 

460 

His Ser Asn Asp 
475 

Thr He Thr Asp 
490 

Ser Val Thr Leu 
505 

His Val Arg Val 



Ser Val Asp Val 

540 

Asp Asp Phe Arg 
555 

Ala He Gly Asp 
570 

Ser He Leu Asn 
585 

Ser Ser He Val 



Met Asp Gin Glu 

620 



Gly Ala Val Ala 



Arg He Ala Ala 

320 

Asn He Pro Gin 
335 

He Asn Ala Ala 
350 

Ala Gin Thr Asn 
365 

Arg He Ala Gly 



Ser Gin Leu Val 

400 

Asp He Ser Leu 
415 

His Gly Gly Ala 
430 

Ala Thr Asp Ser 
445 

Leu Lys Ala He 



Asp Tyr Phe Glu 

480 

Arg Asn Gly Lys 
495 

Pro Thr Gin Pro 
510 

Arg Tyr Lys Glu 
525 

Glu Tyr Thr Val 



Pro Gly Leu Lys 

560 

Thr He Thr Ser 
575 

Lys Asn His Pro 
590 

Thr His Asp Asn 
605 

Phe Thr Tyr Arg 
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Val Lys Asn Arg Glu Gin Ala Tyr Arg He Asn Lys Lys Ser Gly Leu 
"5 630 635 640 

Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys Tyr Tyr Val 

M5 650 655 

Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Arg Ser His Leu 

660 665 670 

Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn Glu Leu Leu 
675 680 685 

Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu Asp Phe Ara 
690 695 700 

Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr Asn Asn Leu 
705 710 715 720 

Asp Ala Phe Gly He Met Asp Tyr Thr Leu Thr Gly Lys Val Glu Asp 

725 730 735 

Asn His Asp Asp Thr Asn Arg He He Thr Val Tyr Met Gly Lys Arg 

740 745 750 

Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp Lys Asp Arg 
755 760 765 

Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Arg Tyr Thr Glv 
770 775 780 

Thr Pro He Pro Asp Asn Pro Asn Asp Lys Asn Asn Ser Gin Leu Val 
785 790 795 800 

Val Ser Val Ala Gly Thr Val Glu Gly Thr Asn Gin Asp He Ser Leu 

805 810 815 

Lys Phe Phe Glu He Asp Leu Thr Ser Arg Pro Ala His Gly Gly Lys 

820 825 830 

Thr Glu Gin Gly Leu Ser Pro Lys Ser Lys Pro Phe Ala Thr Asp Ser 
835 840 845 

Gly Ala Met Ser His Lys Leu Glu Lys Ala Asp Leu Leu Lys Ala He 
850 855 860 

Gin Glu Gin Leu He Ala Asn Val His Ser Asn Asp Asp Tyr Phe Glu 
865 870 875 880 

Val He Asp Phe Ala Ser Asp Ala Thr He Thr Asp Arg Asn Gly Lys 

885 890 895 

Val Tyr Phe Ala Asp Lys Asp Gly Ser Val Thr Leu Pro Thr Gin Pro 

900 905 910 

Val Gin Glu Phe Leu Leu Ser Gly His Val Arg Val Arg Tyr Lys Glu 
915 920 925 

Lys Pro He Gin Asn Gin Ala Lys Ser Val Asp Val Glu Tyr Thr Val 
930 935 940 

Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp Phe Arg Pro Gly Leu Lvs 
945 950 955 960 



wo 96/41883 PCTAJS96/09640 



- 44 - 

Asp Thr Lys Leu Leu Lys Thr Leu Ala He Gly Asp Thr He Thr Ser 

965 970 975 

Gin Glu Leu Leu Ala Gin Ala Gin Ser He Leu Asn Lys Asn His Pro 

980 985 990 

Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser He Val Thr His Asp Asn 
995 1000 1005 

Asp He Phe Arg Thr He Leu Pro Met Asp Gin Glu Phe Thr Tyr Aro 
1010 1015 1020 

Val Lys Asn Arg Glu Gin Ala Tyr Arg He Asn Lys Lys Ser Glv Leu 
1025 1030 1035 1040 

Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys Tyr Tyr Val 

1045 1050 1055 

Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Arg Ser His Leu 

1060 1065 1070 

Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn Glu Leu Leu 
1075 1080 1085 

Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu Asp Phe Ara 
1090 1095 1100 

Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr Asn Asn Leu 
1105 1110 1115 1120 

Asp Ala Phe Gly He Met Asp Tyr Thr Leu Thr Gly Lys Val Glu Asp 

1125 1130 1135 

Asn His Asp Asp Thr Asn Arg He He Thr Val Tyr Met Gly Lys Arg 

1140 1145 1150 

Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp Lys Asp Ara 
1155 1160 1165 

Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Arg Tyr Thr Gly 
1170 1175 1180 

Thr Pro He Pro Asp Asn Pro Asn Asp Lys 
1185 1190 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Lys Thr Glu Glu Gly Lys Leu Val He Trp He Asn Gly Asp Lys 
15 10 15 
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Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 

20 25 30 

Gly lie Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 
35 40 45 

Pro Gin Val Ala Ala Thr Gly Asp Gly Pro Asp lie lie Phe Trp Ala 
50 55 60 

His Asp Arg Phe Gly Gly Tyr Ala Gin Ser Gly Leu Leu Ala Glu He 
^5 70 75 80 

Thr Pro Asp Lys Ala Phe Gin Asp Lys Leu Tyr Pro Phe Thr Trp Asp 

85 90 95 ^ 

Ala Val Arg Tyr Asn Gly Lys Leu He Ala Tyr Pro He Ala Val Glu 

100 105 110 

Ala Leu Ser Leu He Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 
115 120 125 

Thr Trp Glu Glu He Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly 
130 135 140 

Lys Ser Ala Leu Met Phe Asn Leu Gin Glu Pro Tyr Phe Thr Trp Pro 
145 150 155 160 

Leu He Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 

165 170 175 

Tyr Asp He Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 

180 185 190 

Leu Thr Phe Leu Val Asp Leu He Lys Asn Lys His Met Asn Ala Asp 
195 200 205 

Thr Asp Tyr Ser He Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
210 215 220 

Met Thr He Asn Gly Pro Trp Ala Trp Ser Asn He Asp Thr Ser Lys 
225 230 235 240 

Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gin Pro Ser 

245 250 255 

Lys Pro Phe Val Gly Val Leu Ser Ala Gly He Asn Ala Ala Ser Pro 

260 265 270 

Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
275 280 285 

Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 
290 295 300 

Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg He Ala Ala 
305 310 315 320 

Thr Met Glu Asn Ala Gin Lys Gly Glu He Met Pro Asn He Pro Gin 

325 330 335 

Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val He Asn Ala Ala 

340 345 350 
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Ser Gly Arg Gin Thr Val Asp Glu Ala Leu Lys Asp Ala Gin Thr Asn 
355 360 365 

Ser Ser Ser Val Pro Gly Arg Gly Ser He Glu Gly Arg He Ala Glv 
370 375 380 

Pro Glu Trp Leu Leu Asp Ala Pro Ser Val Asn Asn Ser Gin Leu Val 
385 390 395 400 

Val Ser Val Ala Gly Thr Val Glu Gly Thr Asn Gin Asp He Ser Leu 

405 410 415 

Ala Phe Phe Glu He Asp Leu Thr Ser Ala Pro Ala His Gly Gly Ala 

420 425 430 

Thr Glu Gin Gly Leu Ser Pro Ala Ser Lys Pro Phe Ala Thr Asp Ser 
435 440 445 

Gly Ala Met Ser His Lys Leu Glu Lys Ala Asp Leu Leu Lys Ala He 
450 455 460 

Gin Glu Gin Leu He Ala Asn Val His Ser Asn Asp Asp Tyr Phe Glu 
465 470 475 480 

Val He Asp Phe Ala Ser Asp Ala Thr He Thr Asp Arg Asn Gly Lys 

485 490 495 

Val Tyr Phe Ala Asp Lys Asp Gly Ser Val Thr Leu Pro Thr Gin Pro 

500 505 510 

Val Gin Glu Phe Leu Leu Ser Gly His Val Arg Val Arg Tyr Lys Glu 
515 520 525 

Lys Pro He Gin Asn Gin Ala Lys Ser Val Asp Val Glu Tyr Thr Val 
530 535 540 

Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp Phe Arg Pro Gly Leu Lys 
545 550 555 560 

Asp Thr Lys Leu Leu Lys Thr Leu Ala He Gly Asp Thr He Thr Ser 

565 570 575 

Gin Glu Leu Leu Ala Gin Ala Gin Ser He Leu Asn Lys Asn His Pro 

580 585 590 

Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser He Val Thr His Asp Asn 
595 600 605 

Asp He Phe Arg Thr He Leu Pro Met Asp Gin Glu Phe Thr Tyr Ara 
610 615 620 

Val Lys Asn Arg Glu Gin Ala Tyr Arg He Asn Lys Lys Ser Gly Leu 
625 630 635 640 

Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys Tyr Tyr Val 

645 650 



Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Arg Ser His Leu 

660 665 670 

Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn Glu Leu Leu 
675 680 685 
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Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu Asp Phe Aro 
690 695 700 

Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr Asn Asn Leu 
705 710 715 720 

Asp Ala Phe Gly lie Met Asp Tyr Thr Leu Thr Gly Lys Val Glu Asp 

725 730 735 

Asn His Asp Asp Thr Asn Arg lie He Thr Val Tyr Met Gly Lys Arc 

740 745 750 

Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp Ala Asp Arg 
755 760 765 

Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Arg Tyr Thr Gly 
770 775 780 

Thr Pro He Pro Asp Asn Pro Asn Asp Lys Asn Asn Ser Gin Leu Val 
■^SS 790 795 800 

Val Ser Val Ala Gly Thr Val Glu Gly Thr Asn Gin Asp He Ser Leu 

805 810 815 

Lys Phe Phe Glu He Asp Leu Thr Ser Arg Pro Ala His Gly Gly Lys 

820 825 830 

Thr Glu Gin Gly Leu Ser Pro Lys Ser Lys Pro Phe Ala Thr Asp Ser 
835 840 845 

Gly Ala Met Ser His Lys Leu Glu Lys Ala Asp Leu Leu Lys Ala He 
850 855 860 

Gin Glu Gin Leu He Ala Asn Val His Ser Asn Asp Asp Tyr Phe Glu 
865 870 875 880 

Val He Asp Phe Ala Ser Asp Ala Thr He Thr Asp Arg Asn Gly Lys 

885 890 895 

Val Tyr Phe Ala Asp Lys Asp Gly Ser Val Thr Leu Pro Thr Gin Pro 

900 905 910 

Val Gin Glu Phe Leu Leu Ser Gly His Val Arg Val Arg Tyr Lys Glu 
915 920 925 

Lys Pro He Gin Asn Gin Ala Lys Ser Val Asp Val Glu Tyr Thr Val 
930 935 940 

Gin Phe Thr Pro Leu Asn Pro Asp Asp Asp Phe Arg Pro Gly Leu Lvs 
945 950 955 960 

Asp Thr Lys Leu Leu Lys Thr Leu Ala He Gly Asp Thr He Thr Ser 

965 970 975 

Gin Glu Leu Leu Ala Gin Ala Gin Ser He Leu Asn Lys Asn His Pro 

980 985 990 

Gly Tyr Thr He Tyr Glu Arg Asp Ser Ser He Val Thr His Asp Asn 
995 1000 1005 

Asp He Phe Arg Thr He Leu Pro Met Asp Gin Glu Phe Thr Tyr Aro 
1010 1015 1020 
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Val Lys Asn Arg Glu Gin Ala Tyr Arg He Asn Lys Lys Ser Gly Leu 
1025 1030 1035 1040 

Asn Glu Glu He Asn Asn Thr Asp Leu He Ser Glu Lys Tyr Tyr Val 

1045 1050 1055 

Leu Lys Lys Gly Glu Lys Pro Tyr Asp Pro Phe Asp Arg Ser His Leu 

1060 1065 1070 

Lys Leu Phe Thr He Lys Tyr Val Asp Val Asp Thr Asn Glu Leu Leu 
1075 1080 1085 

Lys Ser Glu Gin Leu Leu Thr Ala Ser Glu Arg Asn Leu Asp Phe Ara 
1090 1095 1100 

Asp Leu Tyr Asp Pro Arg Asp Lys Ala Lys Leu Leu Tyr Asn Asn Leu 
1105 1110 1115 1120 

Asp Ala Phe Gly He Met Asp Tyr Thr Leu Thr Gly Lys Val Glu Asp 

1125 1130 1135 

Asn His Asp Asp Thr Asn Arg He He Thr Val Tyr Met Gly Lys Arg 

1140 1145 1150 

Pro Glu Gly Glu Asn Ala Ser Tyr His Leu Ala Tyr Asp Lys Asp Arg 
1155 1160 1165 

Tyr Thr Glu Glu Glu Arg Glu Val Tyr Ser Tyr Leu Arg Tyr Thr Gly 
1170 1175 1180 

Thr Pro He Pro Asp Asn Pro Asn Asp Lys 
1185 1190 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CTGCAGCTAC CTGATACCA6 GCATTTCCAA CAAACATGGT TAAGGCCAAA CCAAAATCAC 60 
TTTCTAGCGT TGGCAAGAGA CCTTCAAGCG AGCGCAAGAC CTTTATTGAA GTTGCTTGTC 120 
GACATAAAAA TGCTGTTTG6 GTTGTGCTGA TA6GCAAAAT GACCTCAAGC CCTGCAATCA 180 
TCTGCTG6AG CAACTCAACT AAGTCAGCTG GTAAAACCTG CTGATGATTG AGGTAAATAA 240 
ACTGAGAAGT CTCAAACAGC TGAGGGGGAT TGCCCTGATG ATCAAGCAAA TACCGCTGCC 300 
AAGGTGACCC TAGCGGCTGC AAGACCTCAT ATTGACCCAA CCCCACCTCA AGTAATAAGC 360 
GCTCTTTTTC GGATAAACAT GATTTGGGAA AATGCACATA TTGGTCCCCT TCTTTGACAC 420 
TCACCCACTC TTTATCTCCT AACGGATGAG 6GCCTACTTG CATCTCTGGA AAATAGTCTT 480 
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TTAGCTCCAT 


AGCCATTCCT 


TTCATGACGG 


TCTTTAAACC 

X %^x X xfu^nw^ 


> AX X AXAAVi#AW 


' AlGACTwTTT 


540 


ATCACACA6T 


TCAGTTTGTT 


GTCAGCACGA 


TTTTGTAfPTT 

A X X X^7 Xc^X X X 


' TCTGPPTTTT 
XwXS9WWX XXX 


1 aATCATTAA 


600 


AACTAAATAA 


GGGTTATTCA 


TTTTTAGCAA 


GAACATTCAA 


TTAAATAGP'T 

XXAAAXAVwX 


AX X X ATwGGa 


f f A 
660 


ATATTAATTT 


ATGTTTATGC 


TAAAAAAGGT 


ATTATTTACC 


X X X X X X WAX X 


wXW*aX XaAAA 


TO A 

720 


TATCATTTTA 


AAAAAATCAA 


TAGGTTTTTA 


TTTGTGTCTT 


TAAAACCATT 


ATGTTATTCT 


780 


AATAATGG66 


ATTGAAACTT 


AACTTTTAGG 


AGGTTTCTAT 


GAAAAATTAC 


TTATCTTTTG 


840 


GGAT6TTTGC 


ACTGCT6TTT 


GCACTAACAT 


TTG6AACA6T 


CAATTCTGTC 


CAAGCTATTG 


900 


CTGGACCTGA 


GTGGCTGCTA 


GACCGTCCAT 


CT6TCAACAA 


CAGCCAATTA 


GTTGTTAGCG 


960 


TTGCTGGTAC 


TGTTGAGGGG 


ACGAATCAAC 


AOATTJVGTPT 
nV«A X X AV7 X Vir X 


TAAATTTTTT 


GAAATCGATC 


1020 


TAACATCACG 


ACCTGCTCAT 


AGGAAA6ACA 


GAGCAAGRPT 


TAAGTCCAAA ATCAAAACCA 


1080 


TTTGCTACTG 


ATAGTGGCGC 


GATGTCACAT 


AAAPTTGAGA 


AAGCTGACTT 


ACTAAAGGCT 


1140 


ATTCAAGAAC 


AATTGATCGC 


TAACGTCCAC 


AGTAACGACG 


ACTACTTTGA 


GGTCATT6AT 


1200 


TTTGCAAGCG 

• 


ATGCAACCAT 


TACT6ATCGA 


AACGGCAAGG 


TCTACTTTGC 


TGACAAA6AT 


1260 


GGTTCGGTAA 


CCTTGCCGAC 


CCAACCTGTC 


CAAGAATTTT 

wXmm7^U^X XXX 


TGCTAAGCGG 


ACATGTGCGC 


1320 


GTTAGACCAT 


ATAAAGAAAA 


ACCAATACAA 


AACP A AGr*G A 


AATCTGTTGA 


TGTGGAATAT 

X^VX W«%AX<%X 


1380 


ACTGTACAGT 


TTACTCCCTT 


AAACCCT6AT 


G A PG A TTTPli 
wAWVAX X X 


GACCAGGTCT 


CAAAGATACT 


1440 


AAGCTATTGA 


AAACACTAGC 


TATCGGTGAP 


A^ WA X AVif A X 


CTCAAGAATT 


ACTAGCTCAA 


1500 


GCACAAA6CA 


TTTTAAACAA 




U\jr V X A X A\.»i7 A 


TTTATGAACG 


TGACTCCTCA 


1560 


ATCGTCACTC 


ATGACAATGA 




AwwrAX X X X AW 


CAATGGATCA 


AGAGTTTACT 


1620 


TACCGTGTTA 


AAAATCGGGA 


ACAAGCTTAT 


AGGATPAATA 

AWUr A X W AA X A 


AAAAATCTGG 


TCTGAATGAA 


1680 


GAAATAAACA 


ACACTGACCT 


GATCTCTGAG 


AAATATTAPG 

AXlbAX AX X AV^O 


TCCTTAAAAA 


AGGGGAAAAG 


1740 


CCGTATGATC 


CCTTTGATCG 


CAGTCACTTG 


AA APTGTTPa 

AAAWXVarX XWA 


CCATCAAATA 


CGTTGATGTC 


1800 


GATACCAACG 


AATTGCTAAA 


AAGTG AG C AG 


PTPTTAAPRG 

w X W X X AA W AV? 


CTAGCGAACG 


TAACTTAGAC 


1860 


TTCAGAGATT 


TATACGATCC 


TCGTGATAAG 


GCTAAACTAP 

S9WXAAAWX AW 


TCTACAACAA 


TCTCGATGCT 


1920 


TTTGGTATTA 


TGGACTATAC 


CTTAACTGGA 


AA AGTAG&r^n 

AAAV9 X AV9 A\7 V7 


ATAATCACGA 


TGACACCAAC 


1980 


CGTATCATAA 


CCGTTTATAT 


GGGHAAGPGA 


WWwVvAAVvwAVar 


AGAATGCTAG 


CTATCATTTA 


2040 


GCCTATGATA 


AAGATCGTTA 


TACCGAAGAA 


GAACGAGAAG 


TTTACAGCTA 


CCTGCGTTAT 


2100 


ACAGGGACAC 


CTATACCTGA 


TAACCCTAAC 


GACAAATAAC 


CACGGTCTTC 


TAAAACGATG 


2160 


AGATTAACTG 


ACAAA2UUUVG 


CAAGCAACAT 


GCTATCAACA 


GTTGCTTGCT 


TTTTTCTAAC 


2220 


CTCTTA6TTG 


TAGAGACTAG 


TGACATTTCG 


TGTCTAA7UVT 


AATCGT7UVCT 


GGTCCATCAT 


2280 


TGATGAGACT 


AACCTGCATA 


TCTGCCCCAA 


AAACGCCACG 


CTCAACTGGC 


ACAAAATCTG 


2340 


CCAATTGTTC 


ATTAAAGC6A 


TCATAAAACT 


GGCTAGCCAT 


ATCAGCTTTG 


CAGCTCCTGT 


2400 
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AAAGGCTGGG CGATTTCCCT TTTTGGTGTC AGCATAAAGG GTAAATTGCG ACACAGATAA 
GATACTACCC TTGATGTCTT GGATAGACTG ATTCATCTTG CCATCAGCAT CTGAAAAAAT 
GCGCATGTTG ACTATTTTTG CACAGCGTAA GCCAAATCTT CTGCAG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) 'TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



2460 
2520 
2566 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATGAAAACTG AA6AAGGTAA ACTGGTAATC TGGATTAACG 6CGATAAA66 CTATAACGGT 60 

CTCGCTGAAG TCGGTAAGAA ATTCGAGAAA GATACCGGAA TTAAAGTCAC CGTTGA6CAT 120 

CCGGATAAAC TGGAAGAGAA ATTCCCACAG GTTGCGGCAA CTGGCGATGG CCCTGACATT 180 

ATCTTCTGGG CACACGACCG CTTTGGTGGC TACGCTCAAT CTG6CCTGTT GGCTGAAATC 240 

ACCCCGGACA AAGCGTTCCA GGACAAGCTG TATCCGTTTA CCTGGGATGC CGTAC6TTAC 300 

AACGGCAA6C TGATTGCTTA CCCGATCGCT GTTGAA6CGT TATCGCTGAT TTATAACAAA 360 

GATCTGCTGC CGAACCCGCC AAAAACCTGG GAA6AGATCC CGGOGCTGGA TAAAGAACTG 420 

AAAGCGAAA6 6TAAGA6CGC GCTGATGTTC AACCTGCAAG AACCGTACTT CACCTGGCCG 480 

CT6ATTGCTG CTGACGGGGG TTATGCGTTC AAGTATGAAA ACGGCAAGTA CGACATTAAA 540 

GAC6TGGGC6 TGGATAACGC TGGCGC6AAA 6CGGGTCTGA CCTTCCTGGT TGACCTGATT 600 

AAAAACAAAC ACATGAATGC A6ACACC6AT TACTCCATCG CAGAAGCTGC CTTTAATAAA 660 

6GCGAAACAG CGAT6ACCAT CAACGGCCCG TGGGCATGGT CCAACATCGA CACCAGCAAA 720 

GTGAATTATG GTGTAACGGT ACTGCCGACC TTCAAGGGTC AACCATCCAA ACCGTTCGTT 780 

GGCGTGCTGA GCGCAGGTAT TAACGCCGCC AGTCCGAACA AAGAGCTGGC GAAAGAGTTC 840 

CTCGAAAACT ATCTGCTGAC TGATGAAGGT CTGGAAGCGG TTAATAAAGA CAAACCGCTG 900 

GGTGCC6TAG C6CTGAAGTC TTACGAGGAA GA6TTGGCGA AAGATCCACG TATTGCCGCC 960 

ACCATGGAAA ACGCCCAGAA A66TGAAATC ATGCCGAACA TCCCGCAGAT GTCCGCTTTC 1020 

TG6TATGCCG TGCGTACTGC GGTGATCAAC GCCGCCAGCG GTCGTCAGAC TGTCGATGAA 1080 

GCCCTGAAAG ACGCGCAGAC TAATTCGAGC TCGGTACCCG GCCGGGGATC CATCGAGGGT 1140 

AGO 3^143 
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Other embodiments are within the following claims: 
What is claimed is: 

1. A compound comprising (a) a plasminogen- 
binding fragment of streptokinase and (b) a blocking 
group at the amino-terminus of said fragment, wherein 

(i) said compound is catalytically active; 

and 

(ii) the rate of in vitro degradation of said 
compound in the presence of human plasminogen is 
at least 2 times slower compared to said rate of 
native streptokinase, wherein said rate is 
measured by the appearance of plasmin cleavage 
products as detected by immunoblotting using anti- 
streptokinase antibodies. 

2. The compound of claim 1, wherein said compound 
comprises the amino acid sequence of SEQ ID NO: 4. 

3. The compound of claim 1, wherein said blocking 
group is a heterologous peptide. 

4. The compound of claim 3, wherein said 
heterologous peptide comprises at least one heterologous 
amino acid. 



5. The compound of claim 4, wherein said 
heterologous peptide is maltose binding protein. 

6. A DNA comprising a coding sequence encoding 
the compound of claim 3. 



7. A method of dissolving blood clots in a mammal, 
comprising administering to said mammal an effective 
amount of the compound of claim 1. 
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8. A plasminogen-binding fragment of 
streptokinase , wherein 

(a) said fragment lacks between 1 and 24 
amino-terminal amino acids; 

(b) said fragment is catalytically active; 

and 

(c) the rate of In vitro degradation of said 
fragment in the presence of human plasminogen is 
at least 2 times slower compared to said rate of 
native streptokinase, wherein said rate is 
measured by the appearance of plasmin cleavage 
products as detected by immunoblotting using anti- 
streptokinase antibodies . 

9. The fragment of claim 8, wherein said fragment 
comprises at least one mutation in a potential plasmin 
cleavage site, wherein said mutation renders said 
cleavage site resistant to cleavage by plasmin. 

10. The fragment of claim 8, wherein said 
fragment consists of the amino acid sequence of (SEQ ID 
N0:4) . 

11. A DNA comprising a coding sequence encoding 
the fragment of claim 10. 

12. A polypeptide comprising a plasminogen- 
binding fragment of streptokinase, wherein 

(a) said fragment is catalytically active; 

and 

(c) the rate of in vitro degradation of said 
polypeptide is at least two times slower compared 
to said rate of native streptokinase, wherein said 
rate is measured by the appearance of plasmin 
cleavage products as detected by immunoblotting 
using anti-streptokinase antibodies. 
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ls. The polypeptide of claim 12, wherein said 
polypeptide comprises at least one mutation in a 
potential plasmin cleavage site, wherein said mutation 
renders said cleavage site resistant to cleavage by 
plasmin • 

14. The polypeptide of claim 13, wherein said 
mutation is selected from the group consisting of RlOA, 

K36A, R45A, K51A, K59A, K61A, K147A, K333, R232A, K257A, 
K298A, K309A, R234A, R363A, K386A, K372A, R388A, R394A, 
and R401A. 

15. The polypeptide of claim 14, wherein said 
polypeptide comprises RlOA, K36A, R45A, K51A and K59A 
(SEQ ID NO: 17) . 

16. The polypeptide of claim 14, wherein said 
polypeptide comprises RlOA, K36A, R45A, K51A, K59A and 
K386A (SEQ ID N0:18). 

17. A DNA comprising a coding sequence encoding 
the polypeptide of claim 14. 

18. A DNA comprising a coding sequence encoding 
the polypeptide of claim 15. 

19. A method of dissolving blood clots in a 
mammal, comprising administering to said mammal an 
effective amount of the polypeptide of claim 15. 
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